Query Expansion by Pseudo Relevance Feedback

نویسنده

  • Zheyun Feng
چکیده

In this document, we describe our algorithms for automatic query expansion. The proposed algorithms are implemented in Java where the Lucene library 1 is modified and used by the proposed algorithms for document retrieval. In order to support accurate document retrieval, we have implemented the okapi(BM25) formulation 2 for measuring the document-query similarity measure. Three query expansion methods are studied and implemented in the attached software, including query expansion based on pseudo relevance feed back [1], query expansion using documents returned by Google search engine, and query expansion using the synonym sets defined by WordNet 3. The rest document is organized as follows: Section 2 describes the Okapi (BM25) formulation for document-query similarity measure, Section 3 describes different approaches for query expansion, and Section 4 presents the evaluation results for the developed approaches for query expansion.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Query expansion based on relevance feedback and latent semantic analysis

Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...

متن کامل

Rutgers Information Interaction Lab at TREC 2005: Trying HARD

Within the structure of the TREC 2005 HARD track guidelines, we investigated the following hypotheses: H1: Query expansion using a “clarity”-based approach will increase effectiveness over baseline queries and baseline queries plus pseudo-relevance feedback; H2: Query expansion based on the Web will increase effectiveness over baseline queries and baseline queries plus pseudo-relevance feedback...

متن کامل

Query Expansion Strategy based on Pseudo Relevance Feedback and Term Weight Scheme for Monolingual Retrieval

Query Expansion using Pseudo Relevance Feedback is a useful technique for reformulating the query. In this paper, expansion terms are obtained by combining pseudo relevance feedback and equi-frequency partition of the documents with tf-idf scoring technique. It is observed that the groups of words that have same tf-idf score as that of query terms are better candidate words for query expansion ...

متن کامل

Pseudo-Relevance Feedback Driven for XML Query Expansion

Pseudo-relevance feedback has been perceived as an effective solution for automatic query expansion. However, a recent study has shown that traditional pseudo-relevance feedback may bring into topic drift and hence be harmful to the retrieval performance. It is often crucial to identify those good feedback documents from which useful expansion terms can be added to the query. Compared with trad...

متن کامل

Query Expansion based on Pseudo Relevance Feedback from Definition Clusters

Query expansion consists in extending user queries with related terms in order to solve the lexical gap problem in Information Retrieval and Question Answering. The main difficulty lies in identifying relevant expansion terms in order to prevent query drift. We propose to use definition clusters built from a combination of English lexical resources for query expansion. We apply the technique of...

متن کامل

NTCIR-5 Query Expansion Experiments using Term Dependence Models

This paper reports the results of our experiments performed for the Query Term Expansion Subtask, a subtask of the WEB Task, at the Fifth NTCIR Workshop, and the results of our further experiments. In this paper we mainly investigated: (i) the effectiveness of query formulation by composing or decomposing compound words and phrases of the Japanese language, which is based on a theoretical frame...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012